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[57] ABSTRACT 

A method and apparatus is disclosed for improving the 
performance of a digital computer by reducing the 
latency of read operations and increasing available 
write bandwidth by utilizing a subset of the address bits 
which are the same from one operation to the next. A 
faster cycle type (e.g. page mode or static column) can 
thereby be employed in the Dynamic Random Access 
Memory (DRAM) memory by eliminating the DRAM 
precharge and RAS address portions of the cycle. 
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FIG. 3 is a block diagram of one embodiment of a 

METHOD AND APPARATUS FOR DRAM DRAM address and control unit according to the pres- 

MEMORY PERFORMANCE ENHANCEMENT ent invention; and 

FIG. 4 is a block diagram providing further detail of 

FIELD OF THE INVENTION 5 the command and address range elements showing se- 

_. ... quential address comparisons. 

The present mvent.on relates to computer memory ^^^^^^^ ^^^^.^^ ^^ ^^^ embodiment of a memory svs- 

systems, and m particular to cache memory systems. ^em is shown in copending application U.S. patent ap- 

BACKGROUND OF THE INVENTION plication Ser. No. 07/263,112 A FLOATING POINT 

10 PIPELINE PROCESSOR, filed Oct. 26, 1988, incor- 

High speed computer memories, in particular cache porated by reference, and U.S. patent application Ser. 

memory performance is strongly influenced by the No. 07/264,058, entitled AN EFFICIENT CACHE 

amount oftime necessary to fill the cache memory with WRITE TECHNIQUE THROUGH DEFERRED 

new data required by the processor. The sooner such a TAG MODIFICATION, filed concurrently herewith 

read operation can be started in main memory, the less 15 and incorporated by reference, 
time it will take to fill the cache. The influence of read 
latency on performance is derived from the read latency 
times the cache miss rate. This is the amount of time by 

which an average processor cycle will be increased due The following memory device read or write steps, 

to cache read latency. 20 cycles, states, signals and associated hardware, individ- 
ually, and in the prior art combination, are known to 

SUMMARY OF THE INVENTION those of ordinary skill in the art and further defined in 

The method and apparatus according to the present product specifications and application notes provided 

invention starts a read operation before a previously f^"" exemplary DRAMS 256kx4 100 ns static columns 

issued write operation, except that a read operation " DRAM manufactured by Toshiba TC514258J- 10, Mit- 

need only wait for write operations to complete if they S\?t<=u m^^^?/,; I'npf ' «^514258JP-10 and 

have an impact on the data to be obtained by the read t T • i„ Tw , „ .. . ^ 

.. 4-. f , . J I. J r In the pnor DRAM cycle illustrated m the state dia- 

operation. Therefore a read may be promoted ahead of m nrr^ ^ t^r^ • j j i /r 

'^ ■ . ■ 1 J iT gram 50 of FIG. 1. When a standard cycle starts (from 

any write operation which does not conflict. 3^ j^j^^ 5 ^^^ ^^^j^ ^^^ ^^^^^^^ f.^^ ^j. ^^^ ^^^_ 

The opportunity for promoting reads over writes ^^„^ j^ ^^^^^ „„ ^^^ j^^^^ J^^^^^ li„^3 ^„^ ^^^ 
occurs anywhere writes are buffered. In these places the ^RAM control lines, Row Address Select (RAS) Chip 
read operation address range is compared to older write gelect (CS) and write W are deasserted. The RAS con- 
operations stored m the buffer or queue. If the addresses trol line is asserted in state RAS 54, then, in state CS/W 
do not overlap, then the write can have no effect on the 35 the columns address is driven on the DRAM address 
result of the read and the read is promoted and issued ijnes and either CS (for a read cycle) or CS and W (for 
first. If the addresses do overlap, then the read waits for a write cycle) is asserted. The normal progress is then to 
the write to be issued. The address comparison may be state data available 58 (DAV) to latch in the data on a 
done sequentially by stepping through the queued write read cycle, or to assert CS to perform a write cycle. The 
commands, as in this implementation, or it may be done 40 cycle then completes by proceeding to state Pchg, 60 
in parallel by replicating the address comparison logic. which deasserts all control lines and then to Idle 52. 

In a read cycle, the cycles for a typical static column By contrast, the operation of the DRAM according 
DRAM are: (1) Row Address applied, (2) Row Address to the present invention is illustrated in the state draw- 
Strobe asserted, (3) Column Address applied, (4) Col- ing 70, FIG. 2 and associate control 100 of FIG. 3. The 
umn Strobe asserted, (5) Strobes deasserted for a pre- 45 operationofstates72, 74 and 76 correspond to states 52. 
charge time. If the Row address is the same from one 54 and 56 of FIG. 1. If, in DAV 78, we recognize the 
operation to the next, then phase 5 of the current cycle ^.^S address for the current cycle is the same as the 
and phases 1 and 2 of the next cycle (and 4 for static ^^^ address for the next command to be executed, then 
column devices) may be omitted thereby decreasing the ^« ^^'^ branch back to CS/W 76 either directly or 
amount oftime required for the next cycle. Identical 5° through state Ovlp 77. In the state Ovlp 77, both W. 123 
savings may be made with write cycles. Address com- ^"'^ ^^' \^^ '=°"*''°' ^'P^^^ ^«= deasserted to avoid 
parison logic is used to determine if the bits which select ""'"tended wnte operations to the DRAM(s) 130. Ovlp 
the set of DRAMS to be used and the Row Address 7 ""A^^ ,\"/ V""^ '^ the timing requirements of 
c. . J . . •. .. -rx.- UI-. . the DRAM cannot be met otherwise, but is aluass 
Strobe address bits are the same. This ability to use a ,, • j u .u . 1 • j j i. 
, _ , . - •' . 55 required when the current cycle is a read and the next 
shorter cycle increases memory performance and re- ^^^ .^ ^ ^^^^ ^^ ^^^^ ^j,^^^^ j^ ^^ j^^j^ ^ 

duces the amount of time it takes to clear wntes from between the assertion of CS for the previous cycle an J 

the command queues and also reduces the latency on ^ for the next cycle. The transition from state DAV 7g 

read operations. to CS/W 76 eliminates either two or three (depending 

BRIEF DESCRIPTION OF THE DRAWINGS ^ °" whether Ovlp is used) of the five states required for 

the standard cycle. 

These and other features of the present invention are Similar savings accrue for a read-modify-write cvcle 

better understood by reading the following detailed The standard (prior art) cycle performs a read cycle as 

description of the invention, taken together with the described above passing through states Idle 52. RAS 54. 

drawings, wherein: 65 CS/W 56 and RWl 62. Then the data read is modified 

FIG. 1 is a prior art DRAM control state diagrams; and written to the same address in states RW2 64 and 

FIG. 2 is a DRAM control state diagram according DAV 58. The cycle then completes by passing through 

to one embodiment of the present invention; Pcgh 60 and returning to Idle 52. 
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By contrast, the present invention provides that if, in 
state DAV 78, (having arrived through states 72, 74, 76, 
82 and 84 analogized to previously described in refer- 
ence to states 52, 54, 56, 62 and 64), the RAS address of 
the next command and the RAS address of the current 5 
cycle are equal, then the state machine passes to CS/W 
either through Ovlp 77 or directly 79. Therefore, the 
system and method of the present invention saves two 
or three (72, 74-and 80) of the seven cycles required by 
the standard cycle 50 of FIG. 1. 10 

The memory structure 100 of FIG. 3 provided com- 
mands for DRAM operations (read, write, read-modify- 
write) which are entered into the command queue (110). 
At each cycle, the command progresses through the 
queue if the next adjacent entry is empty. At each level 15 
a comparison is done in parallel (102, 104, 106, 108) 
between the RAS address field of each entry in the 
queue and the RAS address of the current cycle being 
executed by the DRAM control (120). The comparator 
output associated with the next command to be exe- 20 
cuted by the DRAM control is selected (112) and pres- 
ented to the DRAM control to affect the flow of its 
state machine. The DRAM control presents RAS, CS 
and W control signals 121, 122 and 123 and a multi- 
ple.xed RAS/CS address on leads 114 to the DRAM 25 
array (130) to control its operation according to the 
state shown in diagram 70 of FIG. 2. 



An embodiment of the control logic shown in FIG. 4 
selects the next read command to be executed using 
multiplexer 210 and sequences through all older write 
commands in the queue using multiplexer 211 compar- 
ing the address ranges using comparitor 212. If any 
older write conflicts in address, then the control logic 
waits until that write command is being executed to 
promote the read to be the next operation to be exe- 
cuted. Otherwise the read command is promoted imme- 
diately and presented to the DRAM control logic via 
multiplexer 209 when the current command being exe- 
cuted (if any) is no longer needed by the DRAM con- 
trol. 

Moreover, the present invention permits the merging 
of the data from the read operation of the memory with 
data which is to be updated by the pending write com- 
mands in order to allow pwomotion of reads over write 
operations which do conflict in address. 

Furthermore, the present invention when imple- 
mented can provide a sequence of commands to a pro- 
cessor permitting out-of-order execution of commands 
(including writes) in the main memory, optimized using 
these shorter cycles. For example, in the case of two 
operations which share a RAD address and are sepa- 
rated by other operation(s), the second operation could 
be e.xecuted out of order. 
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1. Memory Subsystem Summary 
1.1 Oveniew 

The main memor)' subsystem is a modular design capable of implementing 8 to 
256MB of ECC R\M on a single board using up to 4 pluggable daughierboard 
modules of 8. 16. 32 or, 64MB each. The DRAMs used are lOOns static column 
CMOS DR,\Ms with a 256K.x4 organization for the S and 16MB modules and a 
1M,\-I organization for the 32 and 64MB modules. 

The goal of the design is to suppon up to 4 CPUs as well as I/O transactions over 
the high bandwidth XBUS by minimizing traditional bottlenecks associated with the 
memory. It achieves this goal by using a high degree of parallelism even in the 
smaller memory configurations and by and intelligence in managing its write buffers 
and recognizing opponunitics for increasing memory bandwidth by using static col- 
umn cycles. 

1.1.1 Features 

• Multiple memory. boards may be used in a system to implement the required 
amount of memory. Each board uses only one backplane connector but, with 
its daughterboards, occupies rwo slots. Each board responds to unique ad- 
dresses for register accesses based on- the slot D and to a programmed ad- 
dress ranee for memory accesses. The only hardware limitations on the num- 
ber of memory boards is the available number of slots and the -4 bit slot ID. 

• Memory is partitioned into modules of 8MB to 64MB (see Figure lAi Each 
module is panitioned into 4 interleaved, independently controlled banks of 
RAM. Each bank has a data width of 32 bits! Module sizes may be mixed 
arbitrarily without any hardware restriction. However, for diagnostic purposes, 
it is most efficient to pair modules of the same size. 

• Each bank of memory implemenu ja error correcting code (ECC) on each 32 
bit data word allowing correction of single bit errors and detection of double 
bit errors (SEC-DED). No data is read during refresh cycles, so only normal 
operation e.xposes ECC errors in the memory. The memory does not rewrite 
locations with errors automatically, so software must rewrite a location with a 
soft error to correct the data in the DR\Ms. 

• Simultaneous single reads or writes arc permitted in independent banks of 
memory even within the same module. Successive commands to the same 
bank and page (actually RAS address) use static column mode to increase 
memory bandwidth. 

• Higher bandwidth is achieved for XBUS read or write multiple commands 
using bursts supponcd by static column RAMs. During a burst two 32 bit data 
words are delivered (or consumed) in each bus cycle. Bursts are limited to 
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1KB and may not cross a page (-1KB) boundary. Writes are accepted up to the 
available queue size if the tareet module is busy. 

• Command queues are included in the module and bank controllers to improve 
efficient use of the XBUS ancl the memory. Limited command reordering is 
used to reduce read latency. 

2. Memory Subsystem Partitioning 

Memory is partitioned into modules of 8. 16, 32, or 6-lMB. Each module is physi- 
cally built on a daughter board. Up to 4 daughterboards may be mounted on each 
mother board (see Figure l/^ The entire set of installed modules may be configured 
as a contiguous memory space regardless of what mix of daughterboard sizes arc 
used or which daughterboard sites arc populated. 

Each daughterboard is panitioned into 4 independent, interleaved banks. There arc 
two Memory Bank Controller (MBC) gate arrays on each daughterboard and so pairs 
of banks share an MBC. 
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This section pro\ides an overview of MMC functionality. Additional information on 
the detailed implementation of the MMC is available in the MMC Functional Specifi- 
cation and Implementation Description. 

Each motherboard has two MMC cate arrays to handle the XBUS protocol. Each 
MMC is logically capable of handling all 4 modules, but. for timing considerations, 
two MMCs arc used to split the load. Each MMC (see Figure 2^ contains a small 
command queue of 4 entries. If the queue is empty and the target MBC's queue can 
accept it. then the command bypasses the NtMC's queue. If. on the other hand, the 
module controller's queue is not 'empty or the target bank cannot accept it, then it is 
queued in the MMC. If the MMCs queue is full, the MMC issues a "busy" ac- 
knowledge to the .XBUS transaction. In the case of a Write-Multiple command, a 
"busy" is returned for the initial transaction if the queue is too full to store at least 
one of the accompanying write data commands. Othenvise it accepts as many write 
data commands as it can before issuing a •'busy'. While the MMC has commands 
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queued, it attempts to pass the oldest read command in front of write commands 
which do not conflict in address. This is intended to reduce the worst case read 
latency whenever possible. 

The module controller performs all XBUS protocol management. It decodes the 
command and address fields and determines whether a bus command has a destina- 
tion on the board and. if so, for which bank(s). Scan-path reset initialization con- 
nfieures the MNIC to respond only to an address derived from its own unique slot ID 
for CSR register references. Scan-initialized constants also determine the starting 
gddrcss of the Stwched modules and their sizes in 8MB increments. Multiple mod- 
ules on a board occupy contiguous memory locations. Any individual module may 
be disabled by programming its size to be 0. 

The module controller monitors the status of each of its MBCs and distributes com- 
mands to them when an MBC command queue is not full. This happens simultane- 
ously with receiving commands from the bus. so the queue performs as a FIFO if 
the target module becomes ready before the queue fills completely. 

The MMC also contains control and status registers for board level functions. If a 
bank controller repons an error, the module controller issues an interrupt to a soft- 
ware programmable address. .\n ECCC interrupt may be directed to a different 
address near the address for an ECCU. ECCU errors on reads are indicated on the 
XBUS with a Read-Response-Error in addition to the interrupt to prevent inadver- 
tent use of corrupt data by the reader. 

2.1.1 Power Up and Reset 

AJ] power up and reset initialization is performed by the scan-path hardware. Dur- 
ing scan-path usage. CS-before-R\S refresh is performed by external hardware to 
keep the DRA.Vf data alive. This backup refresh logic stans if it is. enabled and it 
detects that the normal clocks have stopped. It is stopped by a signal from the SCR. 
The scan logic. also loads the slot ED and module sizes into the MMCs. 

2.1.2 Module Controller Pin Description 

The Module Controller is a 13K gate array in a 256 pin PGA package. 2-i9 Pins are 
actually used. 

The Module Controller must connect to a large subset of the XBUS signals which 
accounts for 91 pins: 
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XBUS Command 
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XBUS Data 


Bidirectional 
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X_DATP(7:0) 
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Bidirectional 
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X_ID(3:0) 
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Bidirectional 
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XBUS Bus Request 
XBUS Bus Grant • 
XBUS Inhibit Arbitration 
XBUS Inhibit Arbitration 
XBUS Disable Input 
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Output 
Input 
Output 
Input 
Input 



Driver/Re.csiver Enables. Error Reporting and Diagnostic Pins account for 4: 
X_DE- XBUS Driver Enable Output 



X RE- 



XBUS Receiver Enable 



Output 



CLK_STP- 


Stop Clocks 


Output 




X_DIAG 


XBUS Diagnostic Mode 


Input 




Communication beiw 


:en the NIMC and the MBC requires 96 


signals: 




B_CMD(2:0) 


Bank Command 


Output 




b"dAT(63:0) 


Bank Data 


Bidirectional 


64 


B~ID(3:0) 


Read Response ID 


Input 




b'sUBID(1:0) 


■ Transaction Subid 


Bidirectional 




B~R£J 


Bank Command Reject 


Output 
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Bank Select 


Output 




bImS-(3:0) 


Module Select 


Output 




b~rr_go- 


Read Response Go 


Output 
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Input 
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Input 




B_RTLL02-(3:0) 


Even MBC Queue Full 


Input 




B_FULL13-(3:0) 


Odd MBC Queue Full 


Input 




Ri_REQ-(3:0) 
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Input 
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Input 




53 Clock, Scan Control, VCC and" ground pins: 
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2.2 Memoo'Modules 

Memory size is determined by the number and tjpe of memory modules installed. 
These modules are implemented as daughter cards which plug onto the mother- 
board. Up to 4 daughter cards may be installed on each motherboard. Each mod- 
ule (sec Figure 9A) incorporates 4 interleaved banks o( memory each 32 data bits 
wide (plus 8 ECC bits for sinele-error-correct/doublc-crror-detect) and the associ- 
ated bank controllers. The larger module uses cither 80 or 360 DR\Ms with either 
256Kx4 or ]Mx4 organization depending on module size. 

Since 160 DR^VIs arc required for each 16MB memoo' module, 20 pin SOJ surface 
mount packaging is usc'd to increase the packing density of the* modules. Double 
sided mounting is required to meet the density goal. 

DR^jMs. drivers and two NtBC gate arrays are installed on the daughter board to 
make a completely modular unit. A configuration register readable only by the 
service processor identifies the DRVvI size, module size, and DRAM vendor. 
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2.2.1 Daughterboard Connector and Plnout 

The pinout of the daughterboard connector is an image of the pinout of the NIBC. 
NIBC pins generally connect to the opposing pin on the connector to reduce the 
routing congestion on the daughterboard. Bus connections between the MBCs are 
made on the motherboard. 

In order to facilitate routing on the motherboard, a 2-10 pin, surface mount, com- 
, press ion s*jl» xonnector is used and only contact pads are required on both* the 
'motherboard and daughterboard. The daughterboard has contact pads on both sides 

of the board and mounts with either component side up or solder side up depending 

on the module location. The motherboard pads can then be directly bussed between 

the two module locations. 

2.2.2 Allowable Memory Configurations 

All combinations of module sizes and depopulations are permitted on the mother- 
board. 

2.3 Memory- Bank Controller (MBC) 

This section provides an' overview of MBC functionality. Additional information on 
the detailed implementation of the MBC is available in the MBC Functional Specifica- 
tion and Implementation Description. 

Each bank of memory contains every fourth 32 bit data word of the module. Two 
banks are controlled by each MBC to reduce cost, overall pin count and complexity. 
Longwords at byte offsets of and 8 are handled by one MBC and longuords at 
byte offsets 4 and 12 by the other. Error correcting codes are implemented in the 
bank controller which allow correction of single bit errors and detection of double 
bit errors. Sec Appendi.x B for the parity matrix of the code used. Normally 7 ECC 
check bits are used for SEC-DED coverage of 32 bits, but in this case it is aug- 
mented by one bit because 4-bit wide DRAMs are used. This augmented ECC 
allows detection of any 3 or 4 bit error that can occur due to a single DRAM failure. 
The error address register and syndrome arc contained in the MBC's CSR space. 

The bank controller (see Figure 4A)conirols the details of the RAM cycle including 
RAS, CS, multiplexed address and write enable as well as data path control and 
application of ECC. CS before RAS refresh is used.' 

2.3.1 Burst Mode Operation 

Static column burst mode is used as required for returning data requested by a Read 
Multiple command. During burst mode operation, pairs of banks arc cycled to- 
gether. The two banks in each MBC alternate in supplying data. Static column burst 
mode is also used in each bank independently to consume the data associated with a 
Write Multiple. Write Multiples may specify ascending or descending addresses to 
allow for buffer copy and stack write operations. If the Read Multiple transfer size 
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Figure 4A Memory Bank Coniroller IC 



is a power of two. it may be specified to Stan anywhere in the block and. at the end. 
wrap around modulo the block size. This function suppons cache fill operations 
where the faulting word is required first (or last), as well as natural order cache 
fills. 

The source of a Read command is assumed to be ready for its read responsc(s). 
■'Busy" and "error" acknowledges from the commander are ignored and no retry of 
the bus transfer is anempted by the memory. 

During a burst, refresh is suppressed and any accumulated refresh requests are 
processed at the end of the burst. The refresh request inter\'al has a 20Tc margin 
built in so that the maximum length burst does not violate the R.\M refresh specifi- 
cation. 
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2.3.2 Scan Mode 



Care must be used in entering and leaving scan mode if the contents of the DRAMs 
is to be preserved. RAS (Row Address Strobe) timing is particularly sensitive. If it 
is violated, then an entire row of cells may be corrupted in each affected DRAM. 
When the gate array clocks stop, external refresh circuitry normally takes over and 
keeps the DRAjM data alive. To restart the MBC, the state of the NIBC must be 
scanned out. modified to restart memory DRAM operations in progress, and. re- 
loaded. 

2.3.3 MBC Pin- Description 

The MBC gate array is a 30K gate array in a 256 pin PG.\. 251 pins are actually 
used. 

Communication between the MMC and the MBC requires 83 signals: 



B_CMD(2:0) 

B~DAT(63.0) 

B~ID(3:0) 

B_SUBID(1:0) 

B_REJ 

B^BSO- 

B~BS1- 

B_MS- 

B~RR_GO- 

b'eccc- 
b'eccu- 

B~FULL- 
Ri_REQ- 
B NACK- 



Baok Command 

Bank Data 

Read Response ID 

Transaction Subid 

Bank Command Reject 

Even Bank Select 

Odd Bank Select 

Module Select 

Read Response Go 

Correctable ECC Error 

Uncorrectable ECC Error ' 

MBC Queue Full 

Read Response Request 

Cancel Read Response Transfer 



Input 3 

Bidirectional 64 

Trislate 4 

Bidirectional 

Input 

Input 

Input 

Input 

Input 

Tristate 

Tristate 

Output 

Output 

Tristate 



Controlling two banks requires 114 pins: (NB: "'Even" and "Odd" below refer to 
quadword and 1 respectively, i.e. bytes 0-7 and 8-15) 



RAM_D0-(39:0) 

R.\M_Dl-(39:0) 

RAM~ADDR0(9:0) 

RAM~ADDR1(9:0) 

RASOO- 

RASOl- 

RASIO- 

RASll- 

CSOO- 

CSOl- 



Even Bank DK\M Data Bidirectional 

Odd Bank DRAM Data Bidirectional 

Even Bank Multiple.xed Address Output 

Odd Bank Multiplexed Address Output 

Even Bank Row Address StrobeO Tristate 

Even Bank Row Address Strobel Tristate 

Odd Bank Row Address StrobeO Tristate 

Odd Bank Row Address Strobel Tristate 

Even Bank Chip SelectO Tristate 

Even Bank Chip Select 1 Tristate 



40 
40 
10 
10 
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CSIO- 

CSll- 

OEO- 

OEl- 

WCBO- 

WCBl- 

WEO- 

WTl- 
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Odd Bank Chip SelectO 
Odd Bank Chip Selectl 
Even Bank Output Enable 
Odd Bank Output Enable 
Even Bank Checkbit Write Enable 
Even Bank Checkbit Write Enable 
Even Bank Write Enable 
Even Bank Write Enable 
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Tristate 




Tristate 




Output 




Output 




Output 




Output 


• '^ 


Output 




Output 





Configuration for Master/Slave, Evca'Oddlongword and diagnostic inputs take 3 
pins: 

B M.ASTER Master/Slave Configuration Input Input 1 

b"dIAG Diagnostic Mode Input ^ 

B~RC INH RAS/CS Inhibit Input 1 



Inter-NIBC communication between the Master and Slave .VBCs requires 8 pins: 



b_ecc_stall_d- 
bIeccIstall_r- 
bdcd_eb_rc-(1:0) 

BDCdIoB_RC-(1:0) 

bselJeven- 
bsel'slv 



ECC Error Stall Request Out 
ECC Error SuU Request In 
Even Bank Request Code 
Odd Bank Request Code 
Select Even Bank 
Select Slave MBC 



Output 

Input 

Bidirectional 

Bidirectional 

Bidirectional 

Bidirectional 



43 Clock, Scan Control, VCC and ground pins: 



CLOGK.\ 


clock 


CLOCKS 


clock ' ' 


CLOCKC 


clock 


CLOCKD 


clock 


A 


Scan Control 


B 


Scan Control 


C 


Scan Control 


D 


Scan Control 


DI 


Scan Control 


DO 


Scan Control 


E 


Scan Control 


GND 


Ground 


VCC 


+5 Volt Supply 



Input 




Input 




Input 




Input 




Input 




Input 




Input 




Input 




Input 




Output 




Input 




Input 


20 


Input 


12 
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3. Memory Control Logic Operation 
3.1 Bus Commands 

The XBUS spec allows the following bus commands: 

Write 

Write Multiple 

Write Data 

Read 

Read Multiple 
Read Response 
Read Response Error 

Broadcast Commands 
NOP 

The following sections examine each command in terms of the operation of the 
memory. Acknowledgements are sent two bus cycles after the command. The VPN 
field of XBUS commands is ignored by the memory. 

3.1.1 Invalid Commands 

The following commands are invalid if directed to the memory. They are not ac- 
knowledged and no action is performed by the memory: 

Read Response 

Read Response Error 

Broadcast Commands 

NOP 

3.1.2 XBUS Write Command " 

The valid b>ie bits arc examined by the MBC and. if the transaction is less than a 
longword write, a read-modify-write cycle is performed and ECC correction is ap- 
plied if required. Otherwise an early write cycle is used and no ECC checking is 
done. There is no indication of the completion of the DRAM cycle unless an ECCC 
or ECCU occurs on a read-modify-write cycle. In such a case, an interrupt is sent 
to the programmed address. In the case of an ECCU. the write is be aborted. 

Write order is always strictly preserved from the instruction stream's point of view. 
Implementation of write order within a single bank is strictly sequential, but writes 
to different banks may be performed out of order. 

3.1.3 XBUS Write Multiple Command 

Write Multiple is acknowledged with "busy'" if there is not at least one additional 
entry free in the module controller's queue for the write data command that follows. 
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If the target module is idle or becomes idle before the bank and module controller 
queues become full, then a burst write is initiated and the memory will be able to 
accept the maximum length transfer. Othenvise a "busy" acknowledgement is be 
sent when the NLMC is completely full. 

If the address of the write multiple is not quadword aligned, then the initial odd 
longword data is contained in the write multiple command. 

3.1.4 XBUS Write Data Command 

The write data" command is the data transfer ponion of a write multiple command. 
If the write multiple is accepted, then there is enough room in the queue for at least 
one write data command and it is queued and distributed to the bank controllers as 
they become ready. When one busy acknowledge is sent in a write multiple/write 
data sequence, all subsequent write data commands are ignored and no acknowl- 
edge is sent. A write data command without a preceeding write multiple command 
from the same commander is also ignored. The write data commands must have 
the same ID and SUBID field as the write multiple command or they are ignored 
and no further write data commands are accepted. The write multiple operation 
terminates with any command other than write data. 

3.1.5 XBUS Read and Read Multiple Commands 

Read commands have a data phase associated with them which uses the read re- 
sponse command. The XBUS is requested in the cycle prior to data becoming 
available to minimize latency. When the bus is granted, a read response is sent with 
the commander's ID and SUBID. Regardless of the address accessed, the data will 
be aligned on (he 64 bit data bus as it is in memory, i.e.. byte occupies bits 
<63:56> and byte 7 occupies bits <7:0>. Data not requested is not guaranteed to be 
valid. 

If the VIBC detects a correctable error in the data being transferred, it signals the 
NLVIC which replaces the read response command by a NOP. The MVIC holds the 
XBUS and inserts an additional NOP to allow the commander time to back up his 
cache fill. The data is corrected in the meanwhile and the corrected data is then 
returned in a read response. If there is time, e.g. due to delay in acquiring the 
XBUS, then the MBC corrects the data transparently with no interruption in the flow 
of data on the XBUS. In cither case an interrupt is then sent to the CPU pro- 
grammed in the MMC CSR. 

If an uncorrectable error occurs, the same sequence is used except that the two 
NOPs are followed by a read response error. A response to a read multiple contin- 
ues to completion despite the error. 

The memory will anempt to pass reads in front of writes which do not conflict in 
address. This will tend to reduce the worst case read and cache fill latency. 
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The bank controllers will also recognize when sequential commands use the same 

RAS address and will use static column mode -to shorten the cycle time and improve 

overall memory bandwidth. 

i. Memory Performance 

Some of the memory performance will be determined by the percentage of the time 
a request hits a busy bank. If the bank has been cycled recently, it must wait a 
precharee time (100ns.) before it can be cycled again. This does not apply to burst 
mode cycles using siatic colunjn accesses. 

4.1 Read Multiple 

4.1.1 Latency 



Bus Cycle 01 234 5 6789 10 

XBUS CMD ^^ , 



Module to Bank (RDV 



DR.\M Address ( R\S X CAS~X CAS ) 

Data from First Bank / \ / \ 
Pair to Module Controller ^ ' ^ ' 

BUSREQ / \ 



IMi.ARB.L ^1 

xBus-cMD (rryrrxrrxrr: 



32 Byte Read Response 



For a read multiple command to an idle memory, one bus cycle is used to transfer 
the command over the bus to the module controller. The second cycle is used to 
check the command, transfer the command to the appropriate bank controllers and 
acknowledge it. Three more cycles are required for the DRAM RAS access time. In 
the following cycle the read response is placed on the bus. A best-case total of 
300ns. Data availability is anticipated and the bus is requested in advance so that 
data availability and bus acquisition can coincide. 
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4.1.2 Bandwidth 

For a read multiple, a read response will be issued every 50ns. by ping-ponging 
between the two bank pairs on a module and using static column cycles. Since 8 
bytes are transferred on each read response, once the burst begins, a peak 
bandwidth of 160MB/sec is maintained. 

4.2 Write Multiple 

4.2.1 Latency 

* . - - • 

. . . • j» 

The latency for executing the first write multiple is similar to that of read multiple 

e.xcept we omit the read response phase. A best-case latency of 300ns.. but only 

the bus transfer is seen by the commander. 

4.2.2 Bandwidth 

Write data can be consumed at a rate of 8 bytes per 50ns. yielding a peak bandwidth 
of 160MB/sec. Ping-ponging and static column cycles are used similar to the case 
of the read command. 

4.3 Read Single 

4.3.1 Latency 

The latency for a read single will be the same as for a read multiple. Read single 
commands should be rare for accesses to memory e.xcept for lock variables. 

4.3.2 Bandwidth 

For a read. a. read response of up to 4 bytes can be issued every 50ns. from inde- 
pendent banks. -The best case bandwidth of 80NtB/sec. (longword read) could be 
sustained. if there were no "hits in a busy bank (and therefore no precharge time 
penalty). - 

4.4 Write Single 

4.4.1 Latency 

A write single for a longword will have the same latency as a write multiple 
(300ns.). If the write is smaller than a longword, a read-modify-write cycle must 
be performed with about a 150ns longer cycle time. 

4.4.2 Bandwidth 

A write single of 1 to 3 bytes takes 7 clocks to e.xecute with precharge time, but a 
loneword write single takes 5 clocks. If a subsequent write single is issued to the 
same K\S address in the same bank, then a 1 to 3 byte write takes only 5 additional 
clocks and a longword write takes only 3 additional clocks. 
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The resulting bandwidth ranges from 2.9 to 26.7 MB/scc. for a single bank. The 
independant banks operate in parallel and therefore increase this range linearly. 

5. Control and Status Registers 

Each memory board has a set of control and status registers, including an error 
registers and configuration register. Each board's registers stan at a bese address 
that is determined by the slot ID as specified in the XBUS specification. AJl regis- 
ters are byte writeable. 

The module controller contains the board's registers at the beginning of the board's 
CSR space. The register offsets from the base address are as follows: 

Interrupt Address Register 
Control/Status Register 

Bank registers occupy the range of offsets from S20-S3F. Each bank's registers 
occupy 8 bytes each: 

Bank Control/Status Register +0 
Bank Error Address Register +4 

The upper 128KB (offsets S20000-S3FFFF) of the memory board's CSR space is 
used for diagnostic access to the ECC check bits. Each byie contains the check bits 
for one data longword and so the 12SKB window covers the check bits for 512KB of 
data memory. 

Use of read multiple to access the memory board's CSR address space is prohibited. 

5.1 MMC Control and Status Registers 

The CSR block is the location for ell software visible state in the MMC. It also 
contains the multiple.xer between read response data from the MBCs and from the 
NtMCCSR. Addressing is described in the XBUS Interface. CSR decode section. 

When the CSR has an XBUS transaction to send, it requests the Bank Interface to 
acquire the XBUS. When it docs, and when it enables the CSR, the CSR inscns its 
interrupt or read response and releases its request. 

5.1.1 Interrupt Address Register [ +0/+8 ] 

This longword contains the 24 bit address of the location to be written when for an 
ECCU interrupt. It also contains an 6 bit field which, for an ECCC interrupt, is 
substituted for the least significant 6 bits of the longword address of the ECCU 
interrupt location. 

16 2J :4 2J ;: 21 u 1: o« 0: 0: oj 00 




^H - Not .■'iffecied by NS'ritc 
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5.1.2 Control and Status Register [ +4/+12 ] 

This recister contains summary error information and control for the board. 
31 30 :9 24 :3 :: :i i6 i5 m os o? o6 o3 



00 



U C 



:ece 



w 



CB Offset 



WM 



U = ECCU Laiched Sliius - \Vriie "1" lo clear 
C = ECCC Latched Siaius - \^nte "1" lo clear 
UE = ECCU Iniernjpi EnaMe 
CE = ECCC Inieirupi E.-.ablc 



W I Check Bil WinJow Enable - 
WM = Check Bil Window Module 



D 



Reserved; Write wnh "0"s 



Brief Explanation of CSR bits 

Latched Status: Set whenever a bank reports the corresponding error. Read/ 
Write I to clear. 

Interrupt Enables: If the Latched Status Bit is set and the corresponding Inter- 
rupt Enable is set, then a write to the appropriate interrupt location is 
scheduled. When the write is sent, the interrupt enable is cleared. 

Check Bit Window Enable: Enables the check bit window. 

Check Bit Window Offset: Offsets the data locations covered by the check bit 
window in steps of 128KB. 

Check Bit V^'indow Module: Module to read for check bit window accesses. 

5.1.3 Interrupt Senice 

The anticipated sequence for servicing a memory interrupt is: 

1. Read each MMC's CSR's Latched Status bits and see which is(are) inter- 
rupting. 

2. Clear the interrupting MMC CSR's Latched Status bits. 

3. Poll the anached MBCs to see which have errors latched, rewrite the loca- 
tion with the error to remove the error, and clear the MBCs latched error 
bits. - 

4. Reenablc the interrupts. 

5. Reread the interrupting MMC CSR's Latched Status bits to see if another 
error has occurred during the service routine. 

5.2 MBC Control and Status Registers 

There are two registers associated with a Bank Controller, the Control and Status 
Register (CSR) and the Error Address Register (E-VR). Each set of registers is 
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shared between iwo bank controllers to economize on hardware. Banks and 2 

share one set, and banks 1 and 3 share the other. The CSRs are only writeable 

through banks and 1. but are readable through either address. 

5.2.1 Bank Controller Control and Status Register 

Each bank controller contains a CSR to control diagnostic and ECC modes. 

M- 30 29 :S 27 16 IJ 14 13 13 13 10 09 05 0^ 



00 



E 


U M 


C 




Error ID 


SUB- D 


\\ 


R 


L 


F 




^g|; B:i'hdntnti7:Qys\^}.- 



E - Error Laiehed (Wril* "l" lo clear) 

U s ECCU 

.M = Multiple ECCUs 

C - Oiagnoiilic Compare Error 



D = Diajnosiic .Mode 

W I Diagfloslic Write .Ml Quadiaots 

R = Raw Mode (ECC Disable on Read) 

L = Lock Compare Rejisier 

F = Freeze Check Biis 



- - Not Affected bv Write 



□ 



- Reserved; Write with "0"s 



Brief E.xplanation of CSR Bits 

E Error Latched. Whenever any ECC or Diagnostic Compare Error is en- 
countered, this bit is set to indicate that the Error Address Register and 
the read-only fields of the CSR are valid. Write a T' to clear' this flag. 

U Set if latched error is an ECCU. If not set then the latched error is an 
ECCC or Diagnostic Compare Error. If an ECCU occurs while either an 
ECCC or a Diagnostic Compare Error is latched, the ECCU information 
will overwrite the latched information. Read Only. 

M Set if multiple ECCU errors have occurred. The latched information is 
for the first ECCU that occurred. Read Only. 

C Set if a Diagnostic Compare Error has occurred. This error is only en- 
abled if the D bit or the L bit is set. 

D Diagnostic Compare Mode. Operation of diagnostic mode requires one 
master module to have the bit reset. Ail modules which have this bit set 
will be slaved xo the master. Slaves will accept all write and read com- 
mand regardless of module destination. No data will be presented by the 
slave to the NIMC. but instead it will compare the internal data to the 
Master's data. Mismatches will result in a diagnostic compare error. 

W Diagnostic Write All Quadrants Mode. Normally a write to the RAM 
array of a bank only affects 10 of the 40 DRAMs. The 10 are selected by 
the RAS and CS select bits (see the EAR description below). When the W 
bit is set, the RS and CS bits of the address are ignored and all 40 
DRAMs are v.ritten. This reduces the time to write the RAM array by a 
factor of 4. 

R Raw Mode. fcCC checking is disabled on read and read-modify-write 
cycles. Correct check bits are generated on write and read-modify-write 
cvcles. 
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L Lock Compare Register. The compare register, used in diagnostic mode 

above, can be frozen to check blocks of Master or single board memory. 

With this bit reset, a read will load the compare register. Setting the bit 

will lock the register and any subsequent read uith different data will 

cause a diagoostic^ompare error. 

. F Freeze Check Bits. Used for creating ECC errors in memory. When set. 
write or read-modify-write operations cause data to be updated in mem- 
ory, but not checkbits.. Whatever checkbits exist in a locatjon remain 
— unchanged. 

EiTor ID, SUBID 

The slot ID and command SUBED of the XBUS device which issued the 
failing command. 

Syndrome 

8 Bit Syndrome of the ECC error. 

5.2.2 Error Address Register (EAR) 

The Error Address Register is read-only. It is frozen at the same time the CSR on 
ECC errors. Only 22 bits of doublc-quadword address are captured. The high 
order 4 bits must be obtained for each module from the configuration table supplied 
by the Service Processor at boot time-. Mapping of physical address to DR,\M ad- 
• dress is shown for diagnostic purposes. 



31 30 29 



Zi 25 



M t :a:fi;jS*-J;s ^Kiri;::;, 



25 24 23 22-21 



c:^C9[rj 



RASiddKS-O) 



04 03 O: 01 00 




:.Mr:CAS Addr(8:(».i V; 



CS » C.\S0'1 Stltci (4Mb DRAMi) 
RS » RASO.'l Seleci (4Mb DRAMs) 
C9 I CAS AJdr(9) (4Mb DRA.Msl 

CASO'l S<l«cl (.l.Mb DRA.Ms) 
W » RAS .\dill(9) (4Mb DRAMil 

R.\S0'1 S<ltcl (1Mb DRA.Mj) 



S I Sparc ueaOi a Ol 

B - Bani Rtponun Enor 

.M » Read-Modilj-W'riie C>clt 

I « CAS .\dJtii was iacremtniti 



- Not .Affected by NS'rite 



Brief Explanation :f EAR Status Bits 

M Latched i;.formation is for a Read-Modify-Write cycle, i.e., a write of 
less than -• bytes. 

I Set if the latched CS address has been advanced 16 bytes beyond the 
failing location. This may happen on read multiple operations since the 
addre'ss is pipelined. 
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B Set to indicate which bank has reported the error. 

NB; since banks and 2 share one register and banks 1 and 3 share 
another, the least significant bit is scan-initialized and static, not dy- 
namic. 

6. Error Handling 

6.1 CorrectabTe ErroFS 

A correctable error may occur whenever data is read from the DRAM's. This can 
happen during, a read response or a write request which is -less than a longword. 

On a read response, the read response is changed to a sequence of two NOPs, the 
data is corrected and supplied in the subsequent bus cycle. An error indication is 
flagged to the module controller which will, in turn, cause an interrupt by writing to 
the address specified in the interrupt address register. The error information is 
frozen in the bank CSR and error address register. Until the ECCC bit is turned off 
in the bank CSR. any subsequent single bit error will be corrected, but will not be 
reponed. 

On a write request of less than a longword. the data is corrected and merged with 
the b>ics to be written. The new longword is written into memory. The status regis- 
ter is frozen and the interrupt is generated. Any subsequent errors that occur while 
the interrupt is pending are treated the same as the read case. 

6.2 Uncorrectable Errors 

An uncorrectable error may occur under the same circumstances as a correctable 
error. 

Uncorrectable errors are handled in the same way a correctable errors e.xcept that 
corrected data cannot be supplied, so the read response is terminated by issuing a 
read rcsj>onse error command after the iwo Js'OPs. Write cycles encountering uncor- 
rectable errors are aboncd without the write being performed. Information on the 
error is loaded into the MBC CSR and error address registers and frozen. An 
interrupt is generated by the NIMC to the address specified in the interrupt address 
register. 

While the register has the uncorrectable error information latched, any subsequent 
correctable errors will be corrected but no information will be logged in the status 
register. If an uncorrectable error occurs, rto information is logged on the second 
ECCU e.xcept the Multiple Errors bit will be set. 

If an uncorrectable error occurs while the register has information latched for a 
correctable error, the ECCU Error bit is set in the CSR and the ECCU information 
overwrites the ECCC data. This is the only sequence of events that will set both the 
ECCC Error and ECCU Error bits in the CSR and the information in the CSR will 
pertain to the ECCU Error. 
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7. Diagnostic Features 

7.1 Diagnostic Mode 

When a module is in diagnostic mode, it becomes a slave to the other module at- 
tached to the MMC. Operations aire performed in parallel with the master and any 
-read rrsponse data issued by the master is latched and checked against the data that 
the slave would have supplied. If the data does not match, the slave generates an 
ECCU indication to the MMC and logs the mismatch in its CSR. 

7.2 Write AU Quadrants 

The DRAMs associated with a bank are divided into four quadrants addressed by 
two RAS and two CS lines. Normally operations (e.xcept for refresh) are performed 
on only 1 of the quadrants at a time. When an MBC is in write all quadrants mode, 
ail four quadrants arc written simultaneously. 

7.3 Raw Mode 

In raw mode. ECC errors will not be checked or reponed. This provides a mecha- 
nism for reading data from a location with an uncorrectable error. 

7.4 Lock Compare Register 

The compare register used by the slave in diagnostic mode can be enabled by this 
mechanism in any module. Once locked, the data from the previous read response 
stays in the register. If any subsequent read response mismatches, the compare 
error flag is set in the MBC CSR and an ECCU error is reported to the MMC. 

7.5 Freeze Check Bits 

When this bit is set, writes to the DRWls do not update the checkbits. This pro- 
vides a mechanism for creating ECCC and ECCU errors, in the memory. 

7.6 Reading the Check Bit R\Ms 

The check bit window is used to read the check bit RAMs. The offset of the window 
must be set in the M>iC CSR and the window enabled first. Read commands are 
allowed up to a longword in length, but read multiple is not allowed in the check bit 
window. Each successive byte in the checkbit window corresponds to the check bits 
for the corresponding data longword. ECC errors are not be checked on reads 
through the window. 

8. Reliability 

The MTBF goal for the memory subsystem should be well in e.xcess of 10,000 
hours. Note that the minimum acceptable system MTBF should be well in e.scess of 
2,000 hours. 
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See Appendix A for subsystem MTBF estimate calculations. The calculations esti- 
mate that the MTBF of each motherboard is in e.xcess of 52,000 hours, so. even with 
three motherboards installed, a memory system MTBF of 17.000 hours may be esti- 
mated. 

8.1 MOS DR\M MTBF 

The MOS RVvl's are static column. 100ns access time, with 256K.\-} and 3Mx4 
organizations. The hard failure rate is estimated to be < 150 FIT (Failures In Time) 
(0.15 hard failures per million device hours). The estimated soft failure rate is < 300 
FTT. 

9. Power 

Only 1/4 of the DRAMs in each bank can cycle simultaneously (except for diagnos- 
tic mode). Each DRAM is capable of dissipating 300mW when active and 20mW in 
standby. A continuously active module has 40 DRAMs active which dissipate 12W. 
Idle modules dissipate about 4W. If we assume that only two modules can be kept 
continuously active then the DRAM dissipation is 32W for a fully populated 
motherboard. The gate arrays, CMOS DRAM address and RAS/CS drivers should 
account for no more than another 15W. If we allot another 5W for the XBUS 
drivers, the total for a fully populated motherboard is about 52W. 
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Appendix A 

A. DRAM MTBF Calculations 

The system MTBF goal should be well above 2000 hours. Therefore all major 
subassemblies should have MTBF values well above that. In panicular the memory 
subsystem should have an MTBF of at about 10.000 - 20,000 hours. The number 
presented below are for a single 64MB memory motherboard. If the system uses 
12SMB then the failure rates are doubled. 

A.l RAM FaUure' Model 

Soft errors are modeled as a Poisson process. The probability density function for 
the occurrence of a "n" errors is 

p(n.t) = \"t"-' e-^' 

where X is the failure rate. The probability function is simply the integral of the 
probability density function. 

The estimated maximum soft error rate ( \ $) is 0.3 errors per million device hours 
(300 FTT). A fully configured memory board of 64MB will have 640 DRAM devices 
on it. The overall soft error rate for the board without error correction is therefore 
192 errors per million device hours or an MTBF of about 5200 hours. This would be 
unacceptable without error correcting codes. 

The estimated maximum hard error rate ( \ h) is 0.15 errors per million device 
hours. The overall hard error rate for a 64MB board would be 96 errors per million 
hours or an MTBF of about 10,400 hours. Note that there are several ^ard error 
mechanisms: complete failure of a chip, failure of a row. failure of a column and 
failure of z single cell. The calculations based on hard errors assume that it is 
always a complete failure of a chip and so are more pessimistic than reality. 

A.2 Effect of Error Correcting Codes on Soft Error Rate 

Use of a SECDED (single error correct, double error detect) code means that only 
multiple bit errors in the same longword produce system failures. Even if soft 
errors are not corrected in memory and we assume that the errors are distributed 
uniformly across the memory, the probability that two soft errors will occur in the 
same longword is 

P(ECCU)=(X,,— ^(X.,-l^-^)'*'^ 



10'64MB 10*64NtB^ 2-4 



So. to reach even a probablity of 10^. takes about 1800 times the single bit error 
MBTF, or about 9 million hours. 
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A.3 R\M Hard Errors 

256Kx4 DRAMs arc used in this memory. The most prevalent failure modes affect 
only a single cell, but perhaps 20% of the hard errors in a 256Ka4 DRAM affect 
more than one output. These errors cause ECCU errors with no warning and no 
opportunity for page retirement. With the total hard error MTBF of 10,400 hours, 
the MTBF for a hard error induced ECCU is, then, approximately 52.000. This is 
an acceptable per motherboard MTBF. 

Note that normally the failure of an entire 256K.\4 DRAM could cause 3 and 4 bit 
errors. The standard seven check bits would not guarantee detecting these errors 
and they could be interpreted as no error or a correctable, single-bit error. The 
addition of an eighth check bit allows the guarantee that any 3 or 4 bit error as the 
result of a single DRAM failure will be correctly identified as an ECCU. 
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B. ECC Parity Algorithm 
B.l ECC Parity Matrix 

Data bit Check word bit 

6 5 4 3 2 
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B.2 256Kx4 DRAM Bit to Device Assignment 



B.2.1 Data Bit 


Assignment 




Data bit 


Device 


Bit 
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B.2.2 Check Bit Assignment 

Bit 7 is assigned as an additional parity bit on the check bits with 256K.\4 DRAVts: 
Check bit Device Bit 

7 9 3. 

6 8 

5 8 1 

4 8 2 

3 8 3 
3 -90 

1 9 1 

9 2 
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What is claimed is: 

1. A fast memory system, comprising 

a static column dynamic random access memory 
(DRAM); 

a command queue for receiving memory commands ^ 
and providing DRAM address signals in response 
to said memory commands; 

a RAM control means for receiving said DRAM 
address signals and issuing DRAM control signals 
to said DRAM; and '0 

address compare means for comparing successive 
DRAM address signals and providing a RAM con- 
trol signal to said RAM control mans when a select 
portion of said DRAM address signals are equal, 
wherein '^ 

the RAM control signal is asserted for a new com- 
mand without providing a different subsequent 
row address signal. 

2. The fast memory system of claim 1, wherein said ,f, 
RAM control means further includes means for deas- 
serting the write (W) and chip select (CS) before said 
write (W) and chip select (CS) signals are reasserted for 

a subsequent write command having a row address 
corresponding to the row address of the previous com- 25 
mand. 

3. A method of high speed static column dynamic 
random access memory data transfer, comprising the 
steps of: 

issuing a first read command for writing data into said 30 
DRAM for data having a first address comprising 
a plurality of signals; 

issuing a second read command for reading data from 
said DRAM for data having a second address com- 
prising a plurality of signals; and 35 

comparing said first and second address; 

starting said second read operation associated with 
said read command before said first read operation 
associated with said write command when the data 
provided by said DRAM in response to said read 40 
operation does not affect said first read operation 
and when said first and second addresses share at 
least a common portion of said address signals. 

4. A method a high speed static column dynamic 
random access memory data transfer, comprising the 45 
steps of: 

entering a plurality of commands into a queue includ- 
ing a respective plurality of queue locations, said 
plurality of commands each including a row ad- 
dress select field; ^^ 

advancing each of said plurality of commands from 
each of said respective plurality of queue locations 
to a respective plurality of next queue locations and 
toward a next command to be executed location, 
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for subsequent execution by a memory control 
means including a state machine; 

comparing said row address select field of a current 
command being e.xecuted, with said row address 
select field of a next command to be executed to 
generate a result; 

outputting said result to said memory control means; 

affecting a sequence of execution of said stale ma- 
chine in accordance with said result; and 

performing memory operations in accordance with 
control signals generated by said memory control 
means in response to said sequence of execution of 
said state machine. 

5. The method of claim 4 further including the step 
of: 

multiplexing a plurality of addresses to said dynamic 
random access memory in accordance with said 
sequence of execution of said state machine. 

6. The method of claim 4 wherein said plurality of 
commands include read, write and read-modify-write 
commands. 

7. The method of claim 4 further including the steps 
of: 

selecting a read command in said queue that is closest 
to said next command to be executed location; 

sequencing through a plurality of write commands in 
said queue and comparing said row address select 
field of each of said plurality of write commands 
with said row address select field of said read com- 
mand; 

waiting until a write command which has been in said 
queue longer than said read command and having 
the same row address select field as said read com- 
mand, is executed; 

promoting said read command to said next command 
to executed location when there is a current com- 
mand being executed; and 

promoting said read command to said current com- 
mand being executed, 

8. The method of claim 4 wherein said step of affect- 
ing a sequence of execution of said state machine in- 
volves branching back in said sequence to a column 
address select state when data is available to perform a 
memory operation and said row address select field of 
said current command being executed is the same as said 
row address select field of said next command to be 
executed. 

9. The method of claim 8 wherein a chip select con- 
trol signal and a write command control signal are deas- 
serted prior to branching back in said sequence to said 
column address select state, when said current com- 
mand being executed is a read command and said next 
command to be e.xecuted is a write command. 
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